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ABSTRACT : PROBLEM TO BE SOLVED: To convert an existed document inputted as picture data into 
a document described by prescribed structured tag language. 

SOLUTION: Picture data inputted from an input means 1 is layout-recognized by a layout 
recognition means 3 and an HTML layout generation means 6 generates the control code 
of HTML based on respective recognized layouts and HTML(hyper text mark up language) 
layout knowledge 5. A character recognition means 4 character-recognizes input picture 
data and outputs a character code. An HTML document generation means 20 generates 
the HTML document by associating the control code of HTML with the character code and 
outputs it from an output means 7. 
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